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RAW SEQUENCE LISTING DATE: 11/29/2000 

PATENT APPLICATION: US/09/44 6 f 4 02 TIME: 13:41:09 

Input Set : A:\Lucyl.app 

Output Set: N:\CRF3\11292000\I446402.raw 

SEQUENCE LISTING 

4 (1) GENERAL INFORMATION : 

6 (1) APPLICANT: Black Jr., Charles A. 

8 (ii) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR ACTIVATING 

9 GENES OF INTEREST 
11 (iii) NUMBER OF SEQUENCES: 16 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: W. Murray Spruill 

15 (B) STREET: 3605 Glenwood Ave. Suite 310 
.16 (C) CITY: Raleigh 

17 (D) STATE: NC 

18 (E) COUNTRY: US 

19 (F) ZIP: 27622 

21 (v) COMPUTER READABLE FORM: 

2 2 (A) MEDIUM TYPE: Floppy disk 

23 ( B ) COMPUTER: IBM PC compatible 

24 (C) OPERATING SYSTEM: PC -DOS/MS - DOS 

2 5 (D) SOFTWARE: Paten tin Release #1.0, Version #1.30 
27 (vi.) CURRENT APPLICATION DATA: 

C--> 28 (A) APPLICATION NUMBER: US/09/44 6,4 02 

C--> 29 (B) FILING DATE: 20-Dec-1999 

3 0 (C) CLASSIFICATION: 

32 (viii) ATTORNEY/AGENT INFORMATION: 

3 3 (A) NAME: Spruill, W. Murray 

34 (B) REGISTRATION NUMBER: 3 2,94 3 

3 5 (C) REFERENCE/DOCKET NUMBER: 5 7 22 -2 

37 (ix) TELECOMMUNICATION INFORMATION : 

38 (A) TELEPHONE: 919 420 2202 

3 9 ( B ) TELEFAX : 919 8 8.1 317 5 

4 2 (2) INFORMATION FOR SEQ ID NO: 1: 

44 (i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 4279 base pairs 
4 6 (B) TYPE: nucleic acid 

47 (C) STRANDED MESS : single 

4 8 (D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: other nucleic acid 

51 (A) DESCRIPTION: /desc = "Recombinant molecule 

W--> 52 (Multiple Cloning Site/Kozack sequence/LacZ gene)" 

5 5 (ix) FEATURE: 

56 (A) NAME/KEY: misc_feature 

57 (B) LOCATION: 1 . .64 

58 (D) OTHER INFORMATION: /product^ "Multiple cloning Site" 

60 (Ix) FEATURE: 

61 (A) NAME/KEY: misc_feature 

62 { B ) LOCATION: 65 . .79 

63 (D) OTHER INFORMATION: /function^ "Consensus sequence of 

64 translation initiation" 
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PATENT APPLICATION: US/09/4 4 6,4 02 TIME: 13:41:09 

Input Set : A:\Lucyl.app 

Output Set: N:\CRP3\11292000\l44 64 02.raw 

6 5 /product^ "Kozack sequence" 

6 7 (ix) FEATURE: 

58 (A) NAME/KEY: prim_transcript 

69 (B) LOCATION: 80.. 4279 

70 (D) OTHER INFORMATION: /gene= "LacZ" 
7.1 /standard_riame = "Beta galactosidase" 

74 (xi) ' SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

7 6 TT A AT ACQ AC TCACTATAGG CTAGCCTCGA GAATTCACGC GTGGTACCTC TAGAGTCGAC 6 0 

7 8 CCGGGCCGCC GCCACCATGG CGCAGCACCA TGGCCTGAAA TAACCTCTGA AAGAGGAACT 120 

8 0 TGGTTAGGTA CCTTCTGAGG CGGAAAGAAC CAGCTGTGGA ATGTGTGTCA GTTAGGGTGT 18 0 
8 2 GGAAAGTCCC CAGGCTCCCC AG C AG G C A G A AGTATGCAAA GCATGCATCT CAATTAGTCA 24 0 
84 GCAACCAGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA G A AG TAT G C A AAGCATGCAT 300 
86 CTCAATT AGT CAGCAACCAT AGTCCCGCCC CTAACTCCGC CCATCCCGCC CCTAACTCCG 360 

8 8 CCC7iGTTCCG CCCATTCTCC GCCCCATGGC TGACTAATTT TTTTTATTTA TGCAGAGGCC 42 0 

9 0 GAGGCCGCCT CGGCCTCTGA GCTATTCCAG AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA 480 
9 2 GGCTTTTGCA AAAAGCTTGG GATCTCTATA ATCTCGCGCA ACCTATTTTC CCCTCGAACA 540 
9 4 CTTTTTAAGC CGTAGATAAA CAGGCTGGGA CACTTCACAT GAGCGAAAAA TACATCGTCA 600 
9 6 CCTGGGACAT GTTGCAGATC CATGCACGTA AACTCGCAAG CCGACTGATG CCTTCTGAAC 660 

9 8 AATGGAAAGG CATTATTGCC GTAAGCCGTG GCGGTCTGGT ACCGGTGGGT GAAGACCAGA 7 20 
100 AACAGCACCT CGAACTGAGC CGCGATATTG CCCAGCGTTT CAACGCGCTG TATGGCGAGA 7 80 
102 TCGATCCCGT CGTTTTACAA CGTCGTGACT GGGAAAACCC TGGCGTTACC CAACTTAATC 8 40 
104 GCCTTGCAGC ACATCCCCCT TTCGCCAGCT GGCGIAATAG CGAAGAGGCC CGCACCGATC 9 00 
.1.0 6 GCCCTTCCCA ACAGTTGCGC AGCCTGAATG GCGAATGGCG CTTTGCCTGG TTTCCGGCAC 9 60 

10 8 CAGAAGCGGT GCCGGAAAGC TGGCTGGAGT GCGATCTTCC TGAGGCCGAT ACTGTCGTCG 10 2 0 
110 TCCCCTCAAA CTGGCAGATG CACGGTTACG ATGCGCCCAT CTACACCAAC GTAACCTATC 1080 
112 CCATTACGGT CAATCCGCCG TTTGTTCCCA CGGAGAATCC GACGGGTTGT TACTCGCTCA 114 0 
114 CATTTAATGT TGATGAAAGC TGGCTACAGG AAGGCCAGAC GCGAATTATT TTTGATGGCG 12 00 
.116 TTAACTCGGC GTTTCATCTG TGGTGCAACG GGCGCTGGGT CGGTTACGGC CAGGACAGTC 12 60 
118 GTTTGCCGTC TGAATTTGAC CTGAGCGCAT TTTTACGCGC CGGAGAAAAC CGCCTCGCGG 1320 
120 TGATGGTGCT GCGTTGGAGT GACGGCAGTT ATCTGGAAGA TCAGGATATG TGGCGGATGA 13 80 
122 GCGGCATTTT CCGTGACGTC TCGTTGCTGC AT AA AC CG AC TACACAAATC AGCGATTTCC 14 4 0 
124 ATGTTGCCAC TCGCTTTAAT GATGATTTCA GCCGCGCTGT ACTGGAGGCT GAAGTTCAGA 1500 

12 6 TGTGCGGCGA GTTGCGTGAC TACCTACGGG TAACAGTT'TC TTTATGGCAG GGTGAAACGC 1560 
128 AGGTCGCCAG CGGCACCGCG CCTTTCGGCG GTGAAATTAT CGATGAGCGT GGTGGTTATG 16 20 
.13 0 CCGATCGCGT CACACTACGT CTGAACGTCG AAAACCCGAA ACTGTGGAGC GCCGAAATCC 16 80 

13 2 CGAATCTCTA TCGTGCGGTG GTTGAACTGC ACACCGCCGA CGGCACGCTG ATTGAAGCAG 17 4 0 
134 AAGCCTGCGA TGTCGGTTTC CGCGAGGTGC GGATTGAAAA TGGTCTGCTG CTGCTGAACG 1800 
136 GCAAGCCGTT GCTGATTCGA GGCGTTAACC GTCACGAGCA TCATCCTCTG CATGGTCAGG 18 60 
138 TCATGGATGA GCAGACGATG GTGCAGGATA TCCTGCTGAT GAAGCAGAAC AACTTTAACG 1920 

14 0 CCGTGCGCTG T ICG CAT TAT CCGAACCATC CGCTGIGGTA CACGCTGTGC GACCGCTACG 19 8 0 
14 2 GCCTGTATGT GGTGGATGAA GCCAATATTG AAACCCACGG CATGGTGCCA ATGAATCGTC 2 04 0 
144 TGACCCATGA TCCGCGCTGG CTACCGGCGA TGAGCGAACG CGTAACGCGA ATGGTGCAGC 2100 
14 6 GCGATCGTAA TCACCCGAGT GTGATCATCT GGTCGCTGGG GAATGAATCA GGCCACGGCG 216 0 
148 CTAATCACGA CGCGCTGTAT CGCTGGATCA AATCTGTCGA TCCTTCCCGC CCGGTGCAGT 22 20 
150 ATGAAGGCGG CGGAGCCGAC ACCACGGCCA CCGATATTAT TTGCCCGATG TACGCGCGCG 2 2 80 
152 TGGATGAAGA CCAGCCCTTC CCGGCTGTGC CGAAATGGTC CATC AAA AAA TGGCTTTCGC 2 34 0 
154 TACCTGGAGA GACGCGCCCG CTGATCCTTT GCGAATACGC CCACGCCATG GGTAACAGTC 24 00 
156 TTGGCGGTTT CGCTAAATAC TGGCAGGCGT TTCGTCAGTA TCCCCGTTTA CAGGGCGGCT 24 60 
158 TCGTCTGGGA CTGGGTGGAT CAGTCGCTGA TTAAATATGA TGAAAACGGC AACCCGTGGT 2 520 
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160 CGGCTTACGG CGGTGATTTT GGCGATACGC CGAACGATCG CCAG TTCTGT ATGAACGGTC 2 53 0 

16 2 TGGTCTTTGC CGACCGCACG CCGCATCCAG CGCTGACGGA ACCAAAACAC CAGCACCAGT 264 0 
164 TTTTCCAGTT CCGTTTATCC GGGCAAACCA TCGAAGTGAC CAGCGAATAC CTGTTCCGTC 27 00 
166 ATAGCGATAA CGAGCTCCTG CACTGGATGG TGGCGCTGGA TGGTAAGCCG CTGGCAAGCG 2760 
168 GTGAAGTGCC TCTGGATGTC GCTCCACAAG GTAAACAGTT GATTGAACTG CCTGAACTAC 28 20 
170 CGCAGCCGGA GAGCGCCGGG CAACTCTGGC TCACAGTACG CGTAGTGCAA CCGAACGCGA 28 8 0 

17 2 CCGCATGGTC AGAAGCCGGG CACATCAGCG CCTGGCAGCA GTGGCGTCTG GCGGAAAACC 29 4 0 

17 4 TCAGTGTGAC GCTCCCCGCC GCGTCCCACG CCATCCCGCA TCTGACCACC AGCGAAATGG 3000 
176 ATTTTTGCAT CGAGCTGGGT AATAAGCGTT GGCAATTTAA CCGCCAGTCA GGCTTTCTTT 3060 
178 CACAGATGTG GATTGGCGAT AAAAAA.CAAC TGCTGACGCC GCTGCGCGAT CAGTTCACCC 312 0 
180 GTGCACCGCT GGATAACGAC ATTGGCGTAA GTGAAGCGAC CCGCA'TTGAC CCTAACGCCT 3180 

18 2 GGGTCGAACG CTGGAAGGCG GCGGGCCATT ACCAGGCCGA AGCAGCGTTG TTGCAGTGCA 3 24 0 
184 CGGCAGATAC ACTTGCTGAT GCGGTGCTGA TTACGACCGC TCACGCGTGG CAGCATCAGG 3 3 00 
1.8 6 GGAAAACCTT ATTTA'TCAGC CGGAAAACCT ACCGGATTGA TGGTAGTGGT CAAATGGCGA 3360 
188 TTACCGTTGA TGTTGAAGTG GCGAGCGATA CACCGCATCC GGCGCGGATT GGCCTGAACT 34 20 
190 GCCAGCTGGC GCAGGTAGCA GAGCGGGTAA ACTGGCTCGG ATTAGGGCCG CAAGAAAACT 34 8 0 
192 ATCCCGACCG CCTTACTGCC GCCTGTTTTG ACCGCTGGGA TCTGCCATTG TCAGACATGT 3 54 0 

19 4 ATACCCCGTA CGTCTTCCCG AGCGAAAACG GTCTGCGCTG CGGGACGCGC GAATTGAATT 3 600 
19 6 ATGGCCCACA CCAGTGGCGC GGCGACTTCC AGTTCAACAT CAGCCGCTAC AGTCAACAGC 36 60 
19 8 AACTGATGGA AACCAGCCAT CGCCATCTGC TGCACGCGGA AGAAGGCACA TGGCTGAATA 3 7 20 
2 00 TCGACGGT'TT CCATATGGGG ATTGGTGGCG ACGACTCCTG GAGCCCGTCA GTATCGGCGG 3 7 80 
2 02 AATTCCAGCT GAGCGCCGGT CGCTACCATT ACCAGTTGGT CTGGTGTCAA A A AT A AT A AT 3 8 40 
2 04 AACCGGGCAG GCCATGTCTG CCCGTATTTC GCGTAAGGAA ATCCATTATG TAC:TATTTAA 3 9 00 
206 AAA AC AC AAA CTTTTGGATG TTCGGTTTAT TCTTTTTCTT TTACTTTTTT ATCATGGGAG 3 9 60 
208 CCTACTTCCC GTTTTTCCCG ATTTGGCTAC ATGACATCAA CCATATCAGC AAAAGTGATA 4 020 
210 CGGGTATTAT TTTTGCCGCT ATTTCTCTGT TCTCGCTATT ATTCCAACCG CTGTTTGGTC 408 0 
212 TGCTTTCTGA CAAACTCGGA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA 414 0 
214 TAG CATC AC A AA'TTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC 4 2 00 
216 CAAACTCATC AATGTATCTT ATCATGTCTG GATCCTCTAG AGTCGACCTG CAGGCATGCA 4260 
218 AGCTGGCACT GGCCGTCGT 4279 
220 (2) INFORMATION FOR SEQ ID NO: 2: 

222 (i) SEQUENCE CHARACTERISTICS: 

223 (A) LENGTH: 20 base pairs 

224 (B) TYPE: nucleic acid 

225 (C) STRANDEDNESS: single 
2 26 (D) TOPOLOGY: linear 

228 (ii) MOLECULE TYPE: other nucleic acid 

229 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 
23 4 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

23 6 GAATACAAAG CITATGCATG 20 
238 (2) INFORMATION FOR SEQ ID NO: 3: 

24 0 (i) SEQUENCE CHARACTERISTICS: 

241 (A) LENGTH: 13 base pairs 

242 <B) TYPE: nucleic acid 
24 3 <C) STRANDEDNESS: single 
244 (D) TOPOLOGY: linear 

246 (ii) MOLECULE TYPE: other nucleic acid 

247 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 
252 (xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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254 GAATACAAAG CTT 13 
256 (2) INFORMATION FOR SEQ ID NO: 4: 
253 (i) SEQUENCE CHARACTERISTICS: 

259 (A) LENGTH: 20 base pairs 

260 (B) TYPE: nucleic acid 

261 (C) STRANDEDNESS : single 

2 62 (D) TOPOLOGY: linear 

264 (ii) MOLECULE TYPE: other nucleic acid 

265 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 
270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

27 2 AAAGCTTATG CATGCGGCCG 20 

274 (2) INFORMATION FOR SEO ID NO: 5: 

276 (1) SEQUENCE CHARACTERISTICS: 

277 (A) LENGTH: 20 base pairs 

278 (B) TYPE: nucleic acid 

279 (C) STRANDEDNESS: single 

280 (D) TOPOLOGY : linear 

282 (ii) MOLECULE TYPE: other nucleic acid 

283 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 
288 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

29 0 CGGCCGCATC TAGAGGGCCC 20 

29 2 (2) INFORMATION FOR SEQ ID NO: 6: 

294 (i) SEQUENCE CHARACTERISTICS: 

29 5 (A) LENGTH: 25 base pairs 

296 (B) TYPE: nucleic acid 

297 (C) STRANDEDNESS: single 
29 8 (D) TOPOLOGY: linear 

300 (ii) MOLECULE TYPE: other nucleic acid 

3 01 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 
3 06 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

308 GCGGCCGCAT CTAGAGGGCC CGGAT 2 5 

310 (2) INFORMATION FOR SEQ ID NO: 7: 

312 (i) SEQUENCE CHARACTERISTICS: 

313 (A) LENGTH: 24 base pairs 

314 (B) TYPE: nucleic acid 

315 (C) STRANDEDNESS: single 

316 (D) TOPOLOGY: linear 

318 (ii) MOLECULE TYPE: oLher nucleic acid 

319 (A) DESCRIPTION: /desc - "Synthetic oligonucleotide" 
324 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

326 AATACAAAGC TTATGCATGC GGCC 24 

328 (2) INFORMATION FOR SEQ ID NO: 8: 

33 0 (i) SEQUENCE CHARACTERISTICS: 

331 (A) LENGTH: 3 0 base pairs 

3 32 (B) TYPE: nuclei c acid 

333 (C) STRANDEDNESS: single 

3 34 (D) TOPOLOGY: linear 

3 36 (ii) MOLECULE TYPE: other nucleic acid 

3 37 (A) DESCRIPTION: /desc - "Synthetic oligonucleotide" 
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342 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

34 4 AATACAAAGC TTATGCATGC GGCCGCATCT 30 

34 6 (2) INFORMATION FOR SEQ ID NO: 9: 

34 3 (i.) SEQUENCE CHARACTERISTICS: 

34 9 (A) LENGTH: 2 0 base pairs 

35 0 (B) TYPE: nucleic acid 
351 ( C ) STRANDEDNESS : single 
3 5 2 ( D ) TO PO LOGY : linear 

3 54 (ii) MOLECULE TYPE: other nucleic acid 

3 55 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 

360 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

362 CATGCATAAG CTTTGTATTC 2 0 

36 4 (2) INFORMATION FOR SEQ ID NO: 10: 
366 (i) SEQUENCE CHARACTERISTICS : 

36 7 (A) LENGTH: 13 base pairs 

3 68 (B) TYPE: nucleic acid 

369 <C) STRANDED N ES S : single 

370 (D) TOPOLOGY: linear 

372 (ii) MOLECULE TYPE: other nucleic acid 

373 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 

37 8 ( x i. ) S EQU EN CE DESCR I PTI ON: S EQ I D NO : 10: 

380 AAGCTTTGTA TTC 13 

382 (2) INFORMATION FOR SEQ ID NO: 11: 

384 (i) SEQUENCE CHARACTERISTICS: 

385 (A) LENGTH: 20 base pairs 

386 (B) TYPE: nucleic acid 

38 7 (C) ST RAND ED MESS : single 
388 (D) TOPOLOGY: linear 

390 (ii) MOLECULE TYPE: other nucleic acid 

3<J1 (A} DESCRIPTION: /desc - "Synthetic oligonucleotide" 

39 1> (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

398 CGGCCGCATG CATAAGCTTT 20 

400 (2) INFORMATION FOR SEQ ID NO: 12: 

4 02 (i) SEQUENCE CHARACTERISTICS: 

403 (A) LENGTH: 2 0 base pairs 

404 (B) TYPE: nucleic acid 

405 (C) STRANDEDNESS : single 
4 06 (D) TOPOLOGY: linear 

4 08 (ii) MOLECULE TYPE: other nucleic acid 

409 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide*' 

414 (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

416 GGGCCCTCTA GATGCGGCCG 20 

418 (2) INFORMATION FOR SEQ ID NO: 13: 

4 20 (i) SEQUENCE CHARACTERISTICS : 

421 (A) LENGTH: 2 5 base pairs 

422 (B) TYPE: nucleic acid 

423 (C) STRANDEDNESS: single 
4 24 (D) TOPOLOGY: linear 

426 (ii) MOLECULE TYPE: other nucleic acid 
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VERIFICATION SUMMARY DATE : .11/29/2000 

PATENT APPLICATION: US/09/446,4 02 TIME: 13:41:10 

Input Set : A:\Lucyl.app 

Output Set: N:\CRF3\11292000\I4464 02.raw 

L:28 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:29 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE : j 
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